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r T | Abstract. We review the statistical properties of the genealogies of a few models 

j, of evolution. In the asexual case, selection leads to coalescence times which grow 

p<| logarithmically with the size of the population in contrast with the linear growth of 

the neutral case. Moreover for a whole class of models, the statistics of the genealogies 
r-r^j are those of the Bolthausen Sznitman coalescent rather than the Kingman coalescent in 

p j the neutral case. For sexual reproduction, the time to reach the first common ancestors 

q to the whole population and the time for all individuals to have all their ancestors in 

• i-h common are also logarithmic in the neutral CcLSG, cLS predicted by Chang [T] . We discuss 
'"^ how these times are modified in a simple way of introducing selection. 

2 

T ~ J PACS numbers: 02.50.-r, 05.40.-a, 87.10.-Mn, 89.75.Hc 

1. Introduction 

in 

The genealogy of a population describes the relationships between all the ancestors of 
this population. Simple questions one may ask about the genealogy of a population are: 

How far one has to go into the past to find the most recent ancestor of two individuals? 

• i—i of m individuals? of the whole population? 

How do these times depend on the sample of m individuals chosen at random in the 
population? 

How do they depend on the size of the population? 

How do they fluctuate when the population evolves over many generations? 
How are they affected by the forces (like selection) acting on evolution? 

In the case of an asexual evolution, the ancestry of a population is a tree, the root of 
which is the most recent common ancestor of the whole population. In the neutral case 
(i.e. when all individuals have on average the same number of surviving offspring at the 
next generation), for a well mixed population, the height of the tree is proportional to 
the size of the population (see figure [T]) and its statistics are described by Kingman's 
coalescent [21 El IH E] (see section 2). 
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Figure 1. A typical genealogical tree in the neutral case, obtained by simulating 
the Wright-Fisher model for a population of N — 100 individuals. For the particular 
realization shown on the figure, the number of generations to reach the most recent 
common ancestor is 125, which is, as expected, of order N. Already for N = 100, all 
the visible nodes are coalescences of pairs of branches and one cannot see any multiple 
coalescences except at the very bottom of the figure where the number of branches is 
still of order N. 
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Figure 2. A typical genealogical tree in presence of selection for a population of size 
N = 100. The number of generations (10 for the realization of the exponential model 
shown on the figure) to reach the most recent common ancestor is much shorter than in 
the neutral case. In contrast to the neutral case, one can observe multiple coalescences 
even rather high in the tree. 



In a series of recent papers, together with A.H. Mueller and S. Munier E], we 
considered a family of models of evolution with selection. For these models, in contrast 
to the neutral case, the height of the tree grows logarithmically with the size of the 
population and, its shape (see figure [2]) is given asymptotically by the Bolthausen- 
Sznitman coalescent [S] rather than by Kingman's coalescent. 

In the case of sexual evolution, each individual has two parents, four grand parents, 
and so on. Each individual is therefore the root of what looks like a tree for the first 
generations in the past. Going further into the past, however, the branches of this 
tree start to merge (see figure [3]) and the number of distinct ancestors do not grow 
exponentially anymore [H E]. The number of ancestors then saturates at a value which 
is a fraction of the whole population living in this remote past (the rest of this past 
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Figure 3. The "tree" of ancestors in the case of sexual reproduction. After a few 
generations (of order the log of the size of the population), the branches start to merge 
and the total number of ancestors saturates (left figure). Furthermore, comparing the 
ancestries of two individuals (right figure) , all their ancestors become identical. 

population consists of all those individuals who had no offspring or whose lineage became 
extinct). Comparing the ancestry of two individuals of the same generation, one sees 
two growing binary trees in the recent past which then start to intermix in a more 
remote past until they become identical. 

Here we try to review a few properties of the genealogies of some simple models of 
evolving populations, both in the neutral case and in the case of selection. In section 2, 
we recall a few properties of the Kingman coalescent and of the Bolthausen-Sznitman 
coalescent, as special cases of coalescent processes and why neutral evolution in the 
asexual case leads to the Kingman coalescent, in the limit of a large population. In 
section 3, we describe the properties of the exponential model, an exactly soluble of 
asexual evolution in presence of selection, and show that its trees follow the statistics 
of the Bolthausen-Sznitman coalescent. In section 4, we show that, like the exponential 
model, more generic models of evolution with selection converge to the Bolthausen- 
Sznitman coalescent. Section 5 is devoted to the case of sexual evolution both in the 
neutral case and in presence of selection. 

2. How to quantify genealogies in the case of asexual reproduction 

To study the statistical properties of the trees generated by sexual reproduction, one 
can adopt several points of view: 

(i) One can try to see how the population can be partitioned into r-families, with 
the rule that two individuals belong to the same r-family if and only if their most 
recent common ancestor is at a distance less than r generations in the past. These 
r-families can themselves be divided into subfamilies by choosing a shorter number 
of generations r', and so on. 

(ii) One can alternatively study the genealogical tree as a dynamical process and try 
to determine the rates qt at which k branches merge into one when one moves from 
the bottom to the top of the tree {i.e. when one looks at the tree backwards in 
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time). These rates might be correlated in time or depend on the characteristics of 
the individuals along the branches which merge. 

(iii) In our previous works [HI EJ HU] we chose to compute for these random trees the 
coalescence times T p defined as follows: T p is the age of the most recent common 
ancestor of p individuals chosen at random in the population. In general the times 
T p depend both on the sample of p individuals chosen and on the generation. 
One advantage of characterizing the random trees by the times T p is that they 
are relatively easy to average (over the samples of p individuals and over the 
generations) in simulations. 

For some models, such as coalescence processes discussed below, one can find explicit 
mathematical formulae which relate these various properties [10J. 



2.1. Coalescence models 

Coalescent processes give a simple procedure to generate a whole class of random trees 
[TT| IT2"} IT5j IT4"] . In a continuous time version, a coalescent process is a dynamical 
stochastic process, where any set of k individuals have a probability q^dt of coalescing 
into one individual during the infinitesimal time interval dt, implying that, if the number 
of branches is b at time t, the probability rf ) (b')dt of having b' branches at time t + dt 
(with b' < b) is given by [TO] 

rhih,) = % FMFl^ H)Vm ' (1) 

From the rates qk, one can easily calculate the times (T p ) (averaged over all the 
realizations of the coalescence process) by analyzing what happens in a steady state 
situation during an infinitesimal time interval dt 

(T p ) = dt+ (T p ) (l-dtJ2 r p (b')) +dt J2 r p (V)(T v ) + O (dt 2 ) . (2) 

\ b'<p J 2<b'<p 

One then gets 

1 (T 3 > Aq 2 - 3q 3 



(T 2 ) 

<h (T 2 ) 3g 2 -2q 3 

(3) 

(T 4 ) = 27q 2 2 - 56q 2 q 3 + 28gf + 12g 2 g 4 - 10g 3 g4 
(T 2 ) (3q 2 - 2q 3 )(6q 2 - 8q 3 + 3g 4 ) 

So q 2 determines the time scale of the times T p and all the ratios (T p ) / (T 2 ) are given by 
ratios of the coalescing rates qk- 

Remark: an alternative way of thinking of the coalescence processes (121 El ITB] 
defined above is to say that during every infinitesimal time interval dt, there is a 
probability p(f)dfdt that a fraction / of all the branches coalesces into a single 
branch, all the other branches remaining unchanged. Then for k given individuals, 
the probability that their k branches merge during dt is given by 

q k dt= f 1 f k p(f)df dt, (4) 
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while the probability rb(b')dt that b distinct branches becomes b' branches during the 
time interval dt is 

and one recovers Q. 

Two special cases will be of interest in what follows: 

(i) The Kingman coalescent 

In the Kingman coalescent, only q 2 ^ and all the qu = for k > 3. Then (|3]) 
becomes 

, T v I Os) = 4 m) = 3 

^ 2; 92 ' (T 2 > 3' (T 2 > 2' W 

In fact all the correlation functions between these times T p can be computed 
PH El 1121 HZj. In particular one gets (T P )/(T 2 ) = 2 -2/ p. 

(ii) T/ie Bolthausen-Sznitman coalescent 

The Bolthausen-Sznitman coalescent jS] was introduced in the context of the mean 
field theory of spin glasses jTSJ UJJ] to represent the tree structure of the pure states 
[20, 21J predicted by the replica scheme invented by Parisi [221E21EI]- In terms of 
the rates g&, the Bolthausen-Sznitman coalescent can be defined by 



leading to 



/ T v = I (M= 5 _ m) = 25 

? 2 ' (T 2 > 4' <T 2 > 18' 18 j 



A natural question, then, is to know whether a given model of evolution (with or 
without selection) gives rise to random trees which can be described by a particular 
coalescent process [TJ 1 H31 171 H51 ITU] . 



2.2. The Wright- Fisher model 

The Wright-Fisher model [UJ ESI El EE] is one of the simplest and most studied models 
of evolution in the neutral case. In its simplest version, it describes a population of fixed 
size N. At every generation g, the parent of each individual is chosen uniformly among 
the N individuals living at the previous generation. It is easy to see that the probability 
qk that k individuals have the same parent at the previous generation is q^ = N 1 ~ k , so 
that qt q2 for k > 2 and large N. Moreover the probability of seeing more than one 
coalescence event among k individuals (with k of order 1) at a given generation becomes 
also much smaller than q 2 for large N. Therefore in the limit of a large population, the 
time (T 2 ) = q 2 l scales like iV 

(T 2 ) ~ N (9) 
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the ratios qu/qi — > for k > 3, and the statistics of the trees are given by the Kingman 
coalescent. 

The Kingman coalescent is particularly central in the theory of neutral evolution 
because it is universal [2SJ EH EEU El ITT] : one can change the rules in the definition 
of the Wright-Fisher model in many ways and in the limit of a large population, one 
always recovers the Kingman coalescent. For example one may choose the parent i 
in the previous generation with a non uniform probability Pi and as long as these Pi 
decay fast enough with the size N of the population, one has in the large N limit 
Ik/12 = (J2iPi)/(J2i.p'i) — > for k > 3 and one recovers the Kingman coalescent 



Remark 1: Starting from the random tree structure of the Kingman coalescent, and 
assuming mutations arising at constant rate along the branches of these trees, one can 
predict the statistical properties of the genetic diversity J2B1 EH EDI EH E2J E31 EI]- 

Remark 2: Even in the large N limit, the shape of the tree and the times T p fluc- 
tuate when one follows the same population over many generations. The correlations 
between these times at different generations can be computed [321 E] ■ These fluctuating 
shapes of the trees are very reminiscent of the tree structure of pure states predicted by 
the mean field theory of spin glasses [201 EU E21 EE] • 

Remark 3: One can define a finite dimensional version of the Wright-Fisher model 
by considering that the individuals of the population are at the nodes of a lattice of 
L d sites with periodic boundary conditions in dimension d and that each individual has 
its parent chosen at random among all the sites at a distance less than I with I <C L. 
[3T1 EH]- In dimension d > 2, the genealogies of such populations have their statistics 
still given by the Kingman coalesent (the problem can be formulated as a reaction diffu- 
sion probleme whose upper critical dimension is 2). On the other hand, in dimension 1, 
the statistical properties of the trees are modified (they can be understood in terms of 
coalescing random walks) and ^ is then replaced by [3H] 

it\ at 2 7 ^ 8 nm 

(T2HiV ' (^) = 5' (^) = 5- (10) 

Note that in this case the spatial aspect is crucial, and there is no choice of the qk for 
which the one dimensional problem could be reduced to a coalescent process as defined 



in section (2.1): for all the coalescent processes, the age T2 of the most recent common 
ancestor of two individuals has an exponential distribution implying that (T|)/ (T 2 ) 2 = 2 
whereas [30], for the finite dimensional model in d — 1, this ratio takes the value 12/5. 
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3. The exponential model: an exactly soluble model of evolution with 
selection 

3.1. Definition 

The exponential model [51 OH] is a simple generalization of the Wright-Fisher model, 
which includes the effect of selection. As in the Wright-Fisher model the size N of 
the population remains the same at every generation. All the individuals at a given 
generation are however not equivalent: each individual i at generation g carries a value 
Xi(g) which represents a trait (or a fitness in the sense used by Bak and Sneppen [10]). 
There are then two steps to go from generation g to generation g + 1 : 

The reproduction step: each individual has its offspring generated by a Poisson 
process of density e~^ x ~ Xi ^\ This simply means that, with probability e~^ y ~ Xi ^'dy, 
there is an offspring of Xi(g) in the infinitesimal interval (y,y + dy). Repeating the 
procedure for all individuals i at generation g, one gets that way an infinite number of 
offspring along the real axis, with many offspring for large negative y and no offspring 
for sufficiently large positive y (note that at the right of any point y on the real axis 
there are always a finite number of offspring). 

The selection step: at generation g + 1, one keeps only the N rightmost points 
among all these offspring. 



The simplicity of the exponential model comes from the following identity 



E < 

Ki<N 



-(x-Xi(g)) 



-(x-X g ) 



with 



X„ 



In 



E ■ 

Ki<N 



This means that the offspring of all the individuals at generation g can be generated by 
a single Poisson process centered at position X g . So X g is the only information about 
generation g needed to generate the next generation. A simple consequence is that the 
successive shifts X g+1 —X g of the position of the population are i.i.d. variables. Another 
advantage of the exponential model is that the population at generation g consists of 



the N rightmost points of a single Poisson process (11) centered at X g . One can then 



show [TO] that the population at generation g + 1 can be generated in the following way: 

x i {g + l)=X g + Y g+1 + y i {g + i), (12) 

where Y g+ i and the N variables yi(g + 1) are independent random variables distributed 
according to 



Prob(F < Y g+1 <Y + dY) 
Prob(y < yi(g + 1) < y + dy) 



1 



exp 



■(N + 1)Y -e 



-Y 



dY, 



e y dy for y > 0, 







for y < 0. 



(13) 



The positions {xi(g)} form a cloud of points which remain grouped as g increases. One 



could locate this cloud of points by its center of mass X 



CM. 



As the points remain 
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grouped, one can as well use the position X g denned in (11) which has the advantage 
that the differences 



X, 



9+1 



X„ = Y q+1 + In 



E 



3 2/«(s+l) 



are uncorrelated random variables. It is easy to see that the difference 

1 



V yCM. 
X 9 ~ X g 



In 



E 



iV 



(14) 



(15) 



has a distribution independent of g, so that all the cumulants of Xg and of X g have 
the same linear growth with g. 

For example the speed of adaptation vn (which is simply the velocity of the cloud 
of points along the real axis) is given by 



v N 



lim 



lim 

g-s>oo 



(X g - X g ^) = (Y g ) + In 



E^ 



yi(g) 



(16) 



g^oo g g^-oo g 

Similarly the generating function G^{(3) of all the cumulants of the positions X g or 

c. 

9 



X^ M - can be computed by 



Gs(fi) = lim 1 °<^ = lim ^ 

g-s-oo g g-s-oo g 

= ln<e-^^«- x «- l )> = ln(e-^) + In 



E 



(17) 



The average over Y g and the 2/i(<?)'s (distributed according to ( 13 )) can be performed 
leading to the following large N behavior [51 E] 

G N {p) ~ -/31nlniV- ( (3 In In X + /3T'(1) + (5 - (3 ^ f! + ) + 



In X 



r(i + /3) 



This gives for large iV 

v N 



lim W 



. . Ar lnlniV + 1 
In In N H h 



IniV 



(X k g ) c 



lim 

g^oo g 



k\ 

hTiV 



E 



+ ... 



(19) 



(20) 



This logarithmic growth of the velocity with the size of the population seems to be 
common to many models of evolution with selection on a smooth landscape [4"T1 |4"2"| H5] 
where each indivdual has a number of offspring proportional to the exponential of its 
fitness. 



3.2. The genealogies in the exponential model 

In the exponential model, given that at generation g + 1 there is an individual at position 
x, the probability Pi(x,g + 1) that this individual is the offspring of the z-th individual 
at generation g is given by 



Pi(x,g + 1) 



-(x-xi(g)) e xi(g) e Vi(g) 

-(x-Xjig)) £\ e a:j(ff) £\ e 2/j(s)' 



(21) 
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The simplicity of the exponential model comes from the fact that these Pi do not depend 
on x and that they can be expressed in terms of the Ui{g) only, implying that the Pi at 
different generations are uncorrelated. 

The probability qk that k individuals have a single common ancestor at the previous 
generation is then given by 

/ ^2ie kyi ^ \ 



qk 



(22) 



where the average is over the N random variables yi(g) distributed according to (13). 
For large N these averages can be computed [10J 

qk * (k -1) In N - (23) 
Moreover, at each generation, the probability of more than one coalescence event among 
a fixed number of individuals becomes negligible in the large N limit [10J. Therefore the 
exponential model converges to the Bolthausen-Sznitman coalescent <Q with coalescence 
times given by (|8| and 

(T 2 ) ~ In N. (24) 



3.3. Conditionning on the speed 

The shapes and the heights of the trees fluctuate with g. In [TU] we tried to determine 
how these fluctuations are correlated to the displacement X g — X . To do so we assigned 
to each evolution event a weight e~^ Xg . This means that, if Tk(g) is the age of the most 
recent ancestor averaged over all the choices of k individuals at generation g (for one 
realization of the process), we tried to compute weighted averages such as 

1 



(Tkh 



lim - 
9 ^°° 9 JZ'i 



(25) 



One can then show [TU] that, with these weights, the probability that k individuals at 
generation g + 1 have their most recent common ancestor at the previous generation is 

—L. (26) 



Qk 



E 7 - e^' (9) 



As for (3 = 0, for large N, the probability of observing more than one coalescence 



(27) 



event at each generation becomes negligible and (26) gives to leading order [TO] 
1 (fc-2)!r(/3 + l) 
qk ~\nN T(f3 + k) 
Therefore for these biased events, the trees have the same statistics as a coalescent whose 
rates are given by (27). This allows one to determine through ^ the coalescence times 
(T p ) (see [TU] for explicit expressions). 



Varying in (27) we see that, conditioning on the displacement X g , the statistics 
of the trees interpolate between the Bolthausen-Sznitman coalescent for /3 = (no bias) 
and the Kingman coalescent for — > oo (bias toward slow velocities). 
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Fitness 



Figure 4. The perfect selection case: at each generation one selects the N (here 
N = 3) rightmost offspring produced by the previous generation. 

4. More generic models of evolution with selection 

In P, El ETU] , we considered a whole family of models of evolution with selection which 
generalize the exponential model. These more generic models are defined as follows: as 
in the exponential model, the population has a fixed size N and each individual % at 
generation g is characterized by a real number (a trait) Xi(g). At the next generation, 
the trait Xi(g + 1) of an offspring i is inherited from its parent P(i,g + 1) up to a random 
number Zi(g + 1) drawn from a fixed distribution p(z) which represents the effect of the 
mutations on the trait Xt 

Xi(g + 1) = x P{ i >g+1) {g) + Zi{g + 1). (28) 

Then comes the selection step where only N survivors among all the offspring produced 
by generation g + 1 are chosen according to their trait Xi{g + 1). 

The model [5J Ej depends on the number of offspring of each individual, on the 
distribution p(z), and on the way the N survivors at generation g + 1 are chosen among 
all the offspring of generation g. 

Here we present the result of numerical simulations of three versions of the model, 
where we always took for p(z) a flat distribution: 

• The perfect selection case: in this version (see figure [i]), each individual at 
generation g has two offspring, but out of the 2N resulting individuals we only 
keep at generation g + 1 the N rightmost ones i.e. those who have the highest 
Xi(g + 1). 

• The fuzzy selection case: here again, each individual has two offspring at the next 
generation, but out of the 2N resulting individuals, the N survivors at generation 
g + 1 are chosen uniformly among the 3N/2 rightmost ones. 

• The two parent selection case: in this version, for each individual at generation 
g + 1, instead of choosing its parent uniformly in the population at generation g 
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as in the Wright-Fisher model, we choose two potential parents uniformly in the 
population and keep as the actual parent the best one among these two [33]. (It is 
equivalent to state that the i-th best individual has on average 2(iV — i)/(N — 1) 
offspring and then draw the new generation with a multinomial distribution.) 

We measured the average coalescence times (T 2 ), (T 3 ) and (T 4 ). The time (T 2 ) 
seems to increase like In 3 N (see figure |5h and the ratios of these times seem to converge 
to the values predicted for the Bolthausen-Snitzman coalescent in the large N limit, 
(see figure [6]) . 

0.70 

0.60 

0.50 

i^Lo.40 
In N 

0.30 
0.20 
0.10 

oo 10 5 10 4 10 3 10 2 

N 

Figure 5. (T 2 )/ln 3 iV as a function of N for the three models. The scale on the 
horizontal axis is 1/ln 2 iV. 
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The above numerical simulations and additional ones (some on larger scales) 
[51 El EU I3S] indicate that for this whole family of models the statistics of the genealogical 
trees converge, in the large N limit, to those of the Bolthausen-Sznitman coalescent fl8J) 
as for the the exponential model, the only difference being that the coalescence times 
(T 2 ) grow like 

(T 2 ) ~ In 3 N. (29) 



instead of In N for the exponential model (24). 

At each generation, the N values Xi(g) form a cloud of points which moves along 
the real axis as g increases. This motion is stochastic and we argued in U\ that 
its evolution can be related, for large N, to that of a noisy Fisher-KPP equation in 
the weak noise limit. In our joint works with A.H. Mueller and S. Munier [HI U\ we 
used a phenomenological theory [32] developed for such travelling wave equations to 
explain the convergence to the Bolthausen-Snitzman coalescent and the In 3 N timescale. 
From this phenomenological theory the following picture emerges: the evolution of the 
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Figure 6. (T 4 )/(T 2 ) (up) and (T 3 )/(T 2 > (down) as a function of N for the three models. 
The dotted lines are the values for the Bolthausen-Snitzman coalescent, respectively 
25/18 and 5/4. The scale on the horizontal axis is 

l/ln 2 7V. 



cloud of points (the Xi(g)) is most of the time deterministic. At time intervals of order 
In 3 A, rare fluctuations occur where the best individual gets an exceptionnally good 
trait Xi(g). These fluctuations relax on a time scale of order In 2 N generations. During 
this relaxation time, the individuals with the exceptionnally good trait have their long 
time lineage less affected by selection than the rest of the population and the net effect 
is that their lineage replaces a positive fraction / of the whole population as in Q; 
the value of / depends on the size of the fluctuation. An analysis of the distribution 
of these rare fluctuations leads to a distribution of /, in Q, which corresponds to the 
Bolthausen-Snitzman coalescent. So the picture resembles that of the model considered 
by Durrett and Schweinsberg [14J where the effect of selection ends up giving rise to a 
coalescent with multiple collisions. 

It might look paradoxical to get the Bolthausen-Sznitman coalescent, with multiple 
branches coalescing at the same point, in situations like the perfect selection case, where 
each individual has only two offspring. This is simply because coalescences of pairs 
at nearby generations look like multiple coalescences on a time scale of order In 3 N. 
Figure [7] illustrates this mechanism. We simulated the model with perfect selection for 
a population of size iV = 10 4 and, at an arbitrary time, we plotted the coalescing tree of 
the ancestries of 100 individuals chosen at random. While a close look makes it apparent 
that there are only coalescences of pairs, one can see several regions where coalescences 
of pairs occur in quick succession, thus resulting into the quasi-instantaneous coalescence 
of more than two lines [2] . Note also that during the In 3 N generations (in real time) 
needed for a fluctuation to occur, the ancestral lines of the individuals diffuse within the 
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Figure 7. A realization of the genealogical tree of 100 individuals chosen at random 
among a population of 10 in the model with "perfect selection". 

whole population. As the size of this cloud is of order In iV along the real axis, it takes 
In 2 N time steps to explore the whole population and by the time the next fluctuation 
occurs, these lines have swept many times over the whole population. This explains 
the perharps counter-intuitive observation that the average time of coalescence between 
several individuals is independent of the positions of these individuals. 

5. Sexual reproduction 

In the case of sexual reproduction, each individual has two parents in the previous 
generation. When the population is large, its number n g of distinct ancestors, g 
generations in the past, starts to grow like 2 g . For any finite population, however, this 
number n g of distinct ancestors has to saturate and the branches of the genealogical 
tree have to merge if one goes far enough into the past (see figure [3|. Looking at a given 
model of evolution, with sexual reproduction, for a population of fixed size, one may 
wonder : 

(i) How does n g depend on gl 

(ii) When does the saturation occur? 

(iii) How far one has to go into the past to find some common ancestors to a group of 
p individuals? 

(iv) When does a group of p individuals have all their ancestors in common? 
5.1. The neutral case 

In the neutral case, all these questions have simple answers. For a simple model discussed 
below, for example, Chang [U HB] has shown that the number generations Gn for all 
individuals to have at least one common ancestor is (for large N) 
\nN 

G N ~ — ~ 1.44 In AT, (30) 
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while the number generations Gn needed for the whole population to have all their 
ancestors in common is 

( 1 1 



G 



N 



\nN ~ 2.55 In N, 



(31) 



,ln2 ln(2(l-x*)) / 

where x* ~ 0.8 is the non zero solution of x* = 1 — exp(— 2x*). 

The simplest model of neutral evolution to describe the genealogies in the case 
of sexual reproduction is to take a population of fixed size N and to assign to each 
individual two parents chosen at random in the previous generation (for simplicity we 
won't make any distinction between males and females; moreover, the two parents of 
each individual being chosen independently in the previous generation, the model allows 
these two parents to coincide a with probability 1/N. All these simplifications are in 
fact unimportant as they do not affect the large A" behavior ( 30pT )). In such a model, 
the number n g of distinct ancestors of a given individual, g generations ago in the past, 
is a Markov process. The distribution of n g+ i, given n g , can be written but it leads to 
a rather complicated formula which is in fact not that useful to understand the large N 
behavior of the model. Instead one can calculate the first moments of n g+ \ by writing 
that 



N 



N 
i=l 



where yi = if individual i at generation g + 1 
one of the n g ancestors at generation g and y\ 
offspring among the n g ancestors at generation 

■y \ 2n g / 2 \ 2n 



(32) 

(in the past) is the ancestor of at least 
= 1 otherwise (i.e. %)i — 1 if i has no 
/). For i,j, k distinct, 



(Vi) 



N 



(ViVj) 



1 



N 



{yiVjyk) 



2n a 



etc. It is then easy to calculate the first moments of n g+ i 

1 \ 2n 9 



K+i> = n 



(n 2 g+1 ) - (n 



) 2 



N 1- 



1 - 
1 

N 



2n a 



n) 



+ N(N-l)(l- 



N 



2n a 



N ( 1 



1 

N 



■hi a 



For large N, one can see that 



in 



3+1/ 



) ~ AT I 1 — exp 



2n, 



and that 



(n 2 g+1 ) - (n g+i y 



A^exp 



2n, 



N 



N 



— (N + 2n g ) exp 



4n n 



N 



for n g ~ 



(33) 

(34) 
(35) 

(36) 
(37) 



N 



for n g N. 



These expressions show that, for large N, the fluctuations of n g+ i are small compared to 
the average (n g+ i) and the evolution of n g is very well approximated by a deterministic 
evolution 

and n = 1, with F{x) = 1 - e~' 2x . (38) 



n 



9+1 



NF 



9 

N 
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As long as n g /N <C 1, the function F(x) can be approximated by a linear function 
F(x) ~ 2x, and one has n g ~ 2 9 for g IniV. On the other hand, in the long time 
limit, n g — > Nx* where x* ~ 0.8 is the attractive fixed point of the map x — > F(x). The 
meaning of x* is simply the fraction of the population whose lineage does not become 
extinct after many generations, while 1 — x* is the fraction of the population whose 
lineage becomes extinct. 

The map x — > F allows one to determine several other properties of the genealogies. 
For example if one tries to compare the genealogies of k individuals and calls n g (k) the 
total number of distinct ancestors of at least one of these k individuals at generation g 



in the past, it is clear that up to a change of initial condition n g (k) evolves as n g in (38): 



n g+l {k) = NF {^-\ with n (k) = k. (39) 

Again, as long as g <C In N, one finds n g (k) = k 2 9 and the ancestors of the k individuals 
are all distinct. On the other hand, all the n g (k) converge to the same value Nx* meaning 
that all ancestors become common to the whole population. 

The number of ancestors m g (2) common to 2 individuals at generation g in the 
past can be written as 

m g (2) = 2n g (l)-n g (2). (40) 

More generally the number of ancestors m g (k) common to k individuals at generation g 

k kl 

^)=£ ipR ^(-irsM (4i) 



For N large and k <C N, one can expand the solution of (39) in powers of 1/N to 

get 

1.2 p 

n g (k) = k 2 9 - — [2 29 - 2 9 ) + — (8 x 2 39 - 18 x 2 2g + 10 x 2 9 ) + ■ • • (42) 



Using this expansion in (41 ), one gets that, as long as k 2 9 -C N, 



nkg 

™ 9 ( k ) - tfi=T ( 43 ) 

By requiring that m g (p) ~ 1, one then obtains the number g^ of generations needed to 
find at least one common ancestor to k individuals 
k - 1 In N 

* K nr¥2 ■ (44) 

This expression agrees well with simulations performed by Stephane Munier |47j . As 



the size k increases, it converges to Chang's expression (30). 



For 2 9 ~ N, the expansion (42) can be rewritten to leading order as a scaling 
function 

/ k 2 9 \ 8 46 

nJk) ~ NH [ with H(x) = x - x 2 + -x 3 x 4 H (45) 

V N ) 9 63 
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where H(x) is the solution of 

H(2x) = F[H(x)] 

which starts at x = as H(x) ■ 
see from (46) that H(x) — > x* . 



(46) 



x and where F is given in (38). For large x, one can 



the attractive fixed point of the map x — > F(x), and 
linearizing the map (38) around this fixed point one gets that for large x 

ln(2(l 



H{x) 



x 



B 



lnx 
m~2 



x 



where 



a 



In 2 



1.3, 



(47) 



and where B is a periodic function B(x) = B(x + 1) of period 1. 

To find the number of generations g\ for a group of k individuals to have all their 
ancestors in common, one should write that n g (k) — m g (k) = 0. Here, as we use 
deterministic equations, the difference n g (k)—m g (k) tends exponentially to zero without 
ever reaching it, so we replace this condition by n g (k) — m g (k) ~ 1 arguing that the 
stochastic nature of the evolution of n g (k) and m g (k) should make this difference vanish 
quickly after it has become of order 1. From (45J47J), 



n g (k) ~ NH 



'k 2 g> 



N 



and one then gets (41) for m g (k) 



m g (k) ~ N 



N c 



E 



N a 1 
x* —B 

2ga fca 



k\ 



'In k -In N " 
ln~2 



1 ga frj - k) 




(4£ 



(49) 



Then, as the function B is periodic, the coefficient to N a remains bounded in ( pp9| ) 
as N becomes large and one gets that n g (k) — m g (k) is N a+1 /2 ga times a number of 
order 1. Therefore, one gets for g\: 



9k 



[l + a) In TV 
aln2 



1 
ln2 



1 



ln(2(l -x*)) 



In N. 



(50) 



We see that for large N, the times do not depend on k to leading order in N and 
that the expression agrees with Chang's prediction (31). 



5.2. An attempt to include selection 

One can imagine various ways of introducing selection in a model with sexual 
reproduction. In an attempt to do so, we consider the same model as in the neutral case: 
a population of fixed size N, each individual having its two parents chosen at random 
in the previous generation. Then we introduce selection by saying that each parent i 
is chosen with a probability pi. We assume that these pi are all of order 1/N, that 
their distribution remains the same at all generations, and that there is no correlation 
between the pi of an individual and the p^s of its parents (the selective advantage is not 
inheritable). All the analysis of the neutral case can be extended to this case, the only 
change being that the function F(x) defined in ( |38| ) becomes 
1 N 

F = -^£(l-e- 2 ^) with J> = 1- (51) 
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Figure 8. The time Gn for the whole population to have all their ancestors in 
common versus 5 when the pi take two values: (1/(1 + AS) with probability 1 — 5 and 
5/(1 + A5) with probability 5. We see that the effect of selection (here having two 
possible values of the pi is to increase the time Gjv while in the limit 5 = or 5 = 1 



one recovers Chang's value (31 1 



As the function F is modified, all the properties which depend on the precise shape of 
F, such as the fixed point x*, are modified. In particular ( 3l|50 ) become 

1 1 



9k 



G 



N 



In 2 InF' 



.7, 



In N, 



(52) 



while the times gk to find at least one common ancestor remain unchanged (44). 

For a particular choice of the pi where pi = 5/(1 + 46) /N for i < N6 and 
Pi = 1/(1 + 46) /N for i > N6 we show the § dependence of Gn on figure |ij We 
see that the time for the whole population to have all their ancetors in common is 
modified by selection and that, at least for the particular distribution considered here, 
selection has the effect of increasing the time g~k ~ Gn- This is somewhat surprising as 
it goes in opposite direction of what we saw in the case of asexual evolution where the 
effect of selection was to shorten the coalescence times. 



6. Conclusion 



Here we have reviewed a series of recent results on simple models of evolution with and 
without selection. 

In the case of asexual evolution, at least for the models we considered here in 
sections 3 and 4, the effect of selection is to make the coalescence times T p (=the age 
of the most recent common ancestor to p individuals) grow logarithmically 
the size of the population in contrast to the linear growth of the (neutral) Wright-Fisher 



24p9) with 
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model (section 2.2). Moreover the statistics of the genealogies are modified by selection 
and seem to be always given by the Bolthausen-Sznitman coalescent Q. 

Apart for the exponential model, for which a full mathematical treatment is possible 
(see section 3 and [5J [7J HO]), a detailed theory, showing that the generic models of 
section 4 lead also to the Bolthausen-Sznitman coalescent, is still missing. Recent 
mathematical works, however, where selection is replaced by an absorbing wall moving 
at a constant velocity jJSJ SHJ ED] or with a more complicated dynamics [51J to keep the 
size of the population bounded have shown that the Bolthausen-Sznitman coalescent 
does give the tree statistics one should see for this whole class of models. 

When looking at the N values Xi{g) as a cloud of points moving along the fitness 
axis, all the models in section 4 lead to a motion of this cloud of points very related 
to the motion of travelling waves of the noisy Fisher-KPP equation. In fact the same 
phenomenological theory [I5J E] which allows one to unsderstand the fluctuations of the 
position of these travelling waves can be used for the models of section 4 and leads to 
the Bolthausen-Sznitman statistics of the genealogies of these models. 

Other models of evolution with selection on a smooth fitness landscape have been 
studied in the past (HI H2J [43] . In these models each individual has an average number of 
offspring proportional to the exponential of its fitness (very much like in the exponential 
model of section [3]) giving a huge advantage to the leaders. Obviously, an interesting 
open question would be to investigate their genealogies in order to see whether these 
other models lead to new statistics or if they belong to the Bolthausen-Sznitman 
universality class. Whether other ways of introducing selection (rugged landscapes |52j , 
competition between a few alleles [53J) could lead to a few number of other universality 
classes is also an interesting question to investigate. 

In the sexual case, we have recalled the main results due to Chang pQ, on the 
numbers of generations needed to find at least one common ancestor or all common 
ancestors to the whole population in the neutral case. In an attempt to include selection, 
we have seen how these times are modified, with the rather counter-intuitive result that 
selection might increase the number of generations to have all ancestors common to 
the whole population (in contrast to what happens in the asexual case where this time 
decreases due to selection). Of course a more exhaustive study of the effect of selection 
on the genealogies, in the case of sexual reproduction, would be suitable in particular 
when one includes recombination J54J ESI EE] • 
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